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AMENDMENTS TO THE CLAIMS 

This listing of claims replaces all prior versions, and listings, of claims in the 
application. 

1. (Original) A method for converting a digitized melody into a sequence of 
notes, comprising: 

segmenting said melody into a series of frames; 

computing a spectral energy distribution (SED) indicator for each frame; and 
estimating initial breakpoints in said melody based on said SED indicator, said 
notes being defined between adjacent initial breakpoints. 

2. (Currently Amended) -A The method according to claim 1, wherein=4fee 
value of said SED indicator has a value, the value for a given frame that is relatively 
large if an energy distribution associated with said frame is concentrated in at least 
on e or more- specified frequenc y banda band . 

3. (Currently Amended) =A The method according to claim 2, including 
filtering said melody with a high pass filter prior to segmenting said melody into said 
frames. 

4. (Currently Amended) -A The method according to claim ?, therein 
including determining said energy distribution io determined from a normalized 
energy spectrum of said frame. 

5. (Currently Amended) ~A The method according to claim 3, wherein said 
specified frequency band is the upper portion of a 0 to 4 kHz range. 
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6. (Currently Amended) ^ The method according to claim 3, wherein4he said 
SED indicator is defined as 

Zf(k) g (X(k)) 

Zg(X(k)) ' WhCre X( - k) iS the ener §y spectrum of a frame at frequency bin k and f(k) 

k 

and g(X(k)) are non-negative and non-decreasing functions of k and X(k), 
respectively. 

7. (Currently Amended) ^ The method according to claim 6, wherein=*e said 
SED indicator is defined as 

ZkX(k) 

k 

£X(k) ' 

k 

8. (Currently Amended) -A The method according to claim 6, wherehrtfee said 
SED indicator is defined as 

k 

1.X {k) 

k 



9. (Currently Amended) =4 The method according to claim 6, wherein=*e said 
SED indicator is defined as 
Zk 2 X(k) 

k 



Zx(k) 



10. (Currently Amended) -A The method according to claim 6, wherein-the 
said SED indicator is defined as 

, where K is the frequency bin corresponding to the Nyquest frequency. 
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1 1 . (Currently Amended) =A The method according to claim 6 ? wherein4fee 
said SED indicator is defined as 

ZkX(k) 2 

k 

ZX(k) 2 ' 

k 

1 2. (Currently Amended) =A The method according to claim 3, wherein the 
including compiling auto-correlation of each=ssid fram e ia computed and computing 
said SED indicato r is computed by estimating=tfee slope at the origin of the framed 
auto-correlation and normalizing=fe£ the slope by the value at the origin. 

1 3 . (Currently Amended) =A The method according to claim 1, including 
estimating=lfe€s pitch of each=ssid frame. 

14. (Currently Amended) -A The method according to claim 13, wherein 
estimating=4he said pitch of each frame comprises: 

computing*© auto -correlation of each=s^4 frame; and 
estimating the pitch of each-sart frame by selecting a pitch period 

corresponding to a shift where4ke an auto-correlation coefficient associated with4fee 

said frame is relatively large. 

1 5. (Currently Amended) =A The method according to claim 1 9 including 
estimating*© pitch of each-said note between said adjacent initial breakpoints. 

16. (Currently Amended) -A. The method according to claim 15 ? wherein 
estimating=fee said pitch of each note between said initial breakpoints comprises: 

computing4fee auto-correlation of each=s^d frame; 

estimating the pitch of each-ssid frame by selecting a pitch period 
corresponding to a shift where*© an auto -correlation coefficient associated with=ft@ 
said frame is relatively large; and 
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averaging or taking the median of=4he pitch estimates of said frames between 
said adjacent breakpoints. 

17. (Currently Amended) =A The method according to claim 15, including 
associating each=said initial breakpoint with a confidence level, which is influenced by 
at least one of (a) A© degree in4ke change or rate of change of said pitch in=fefee said 
frames around=fe© said initial breakpoints, and (b)4he value of said SED indicator in 
the vicinity of=Ae said initial breakpoint, 

18. (Currently Amended) -A The method according to claim 17, wherein4fee 
said confidence level is further influenced by the energy level of said melody in the 
vicinity of=#*€ said initial breakpoint. 

19. (Currently Amended) =A The method according to claim 17> including 
eliminating from consideration initial breakpoints associated with confidence levels 
below a specified threshold, thereby identifying said breakpoints in said melody. 

20. (Currently Amended) ^ The method according to claim 19, including 
estimating^© said pitch and beat duration of each said note between said breakpoints. 

2 1 . (Currently Amended) =Az The method according to claim 1 5 wherein the 
melody is a voice-hummed melody composed of a series of uttered semi-vowels. 

22. (Currently Amended) Apparatus An apparatus for converting a digitized 
melody into a sequence of notes, comprising: 

means for segmenting said melody into a series of frames; 
means for computing a spectral energy distribution (SED) indicator for each 
frame; and 
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means for estimating initial breakpoints in said melody based on said SED, said 
notes being defined between adjacent initial breakpoints, 

23. (Currently Amended) Apparatus The apparatus according to claim 22, 
wherei n the value of said SED indicator has a value for a given frame that is relatively 
large if an energy distribution associated with said frame is concentrated in one or 
more specified frequency bands, 

24. (Currently Amended) -Appar-afe g The apparatus according to claim 23 , 
including a high pa ss filter for filtering said melody- ^th-a-high-pasg-fift eg prior to 
segmenting said melody into said frames, 

25. (Currently Amended) Apparatus The apparatus according to claim 24, 
wherein said energy distribution is determined from a normalized energy spectrum of 
said frame, 

26. (Currently Amended) Apparatus The apparatus according to claim 24, 
wherein said specified frequency band is the upper portion of a 0 to 4 kHz range, 

27. (Currently Amended) A method for converting a digitized melody into a 
sequence of notes, comprising: 

segmenting said melody into a series of frames; 

computmg4h# auto-correlation of each aaid frame; 

estimating=the pitch of each-saW frame based on (i) a pitch period 
corresponding to a shift where4he an auto-correlation coefficient associated with4he 
said frame is relatively large and (ii)=fee closeness of=fte pitch estimate to estimates in 
at least on e or more adjacen t frames frame ; and 

estimating breakpoints in said melody based on changes in said pitch estimates, 
said notes being defined between adjacent breakpoints. 
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28. (Currently Amended) -A The method according to claim 27, wherein 
including estimating said breakpoints arc estimated based on=s rate of change of said 
pitch estimates. 

29. (Currently Amended) =A= The method according to claim 27, including 
filtering said melody with a band pass filter prior to segmenting=fee said melody into 
frames, 

30. (Currently Amended) =A The method according to claim 27 ? including 
estimating=4he said pitch of each note by selecting^© average or median pitch of=4ke 
said frames falling within a pair of said breakpoints. 

3 1 . (Currently Amended) -A The method according to claim 27, wherein=Ae 
said melody is a voice- hummed melody. 

32. (Currently Amended) Another aspect-of-the; invention pro video a A 
method for identifying breakpoints in a digitized melody, the method comprising: 

segmenting4fe€ said melody into a series of frames; 
computing4he auto-correlation of each frame; 

estimating=4hs pitch of each frame based on (i) a pitch period corresponding to 
a shift where4he an auto-correlation coefficient associated with=fee said frame is 
relatively large and (ii)=fee closeness of=4fes pitch estimate to estimates in at least one 
or more adjacen t frames frame ; 

determining regions of said melody where pitch estimates are likely to be 
invalid; and 

identifying said breakpoints in=A# said melody based on transitions between 
frames having valid pitch estimates and transitions having invalid pitch estimates. 
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33. (Currently Amended) ^ The method according to claim 32, wherein 
including estimating said breakpoints ' are -estimated based on=^ rate of change of said 
pitch estimates. 

34. (Currently Amended) ^ The method according to claim 32, including 
filtering said melody with a band pass filter prior to segmenting-^© said melody into 
frames. 

35. (Currently Amended) =A The method according to claim 32, including 
estimating=fee said pitch of each note by selecting=4he average or median pitch of=4fee 
said frames falling within a pair of said breakpoints. 

36. (Currently Amended) =^ The method according to claim 32, wherein4he 
said melody is a voice-hummed melody. 

37. (Currently Amended) -Apparatus An apparatus for converting a digitized 
melody into a sequence of notes, comprising: 

means for segmenting said melody into a series of frames; 

means for computing4fce auto -correlation of each=ssid frame; 

means for estimating*^ pitch of each=s*rid frame based on (i) a pitch period 
corresponding to a shift where4he an auto-correlation coefficient associated with4fee 
said frame is relatively large and (ii)4be closeness of4fee pitch estimate to estimates in 
at least on e or more adjacen t frames frame : 

means for determining regions of said melody where pitch estimates are likely 
to be invalid; and 

means for estimating breakpoints in said melody based on changes in said pitch 
estimates or transitions between frames having valid pitch estimates and frames 
having no pitch estimates, said notes being defined between adjacent breakpoints. 
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38. (Withdrawn) A method of retrieving at least one entry from a music 
database, wherein each said entry is associated with a sequence of pitches and beat 
durations, said method comprising: 

receiving a digitized representation of an input melody; 

identifying breakpoints in said melody in order to define notes therein, each 
said notes being delineated by adjacent breakpoints; 

assigning a confidence level to each note or each breakpoint ; determining a 
pitch and beat duration for each note of said melody; 

determining a score for each said entry based on a search which minimizes the 
cost of matching the pitches and beat durations of said melody and said entry, wherein 
said search considers at least one deletion or insertion error in a selected note of said 
melody and, in this event, penalizes the cost of matching based on the confidence 
level of the selected note or a breakpoint associated therewith; and 
presenting said at least one entry to a user based on its score. 

39. (Withdrawn) A method according to claim 38, wherein said pitches and 
beat durations are relative pitches and relative beat durations. 

40. (Withdrawn) A method according to claim 38, wherein the cost of 
matching a given note Xi of said melody with a given note Yj associated with said 
entry is: 

match _cos t{X i9 ¥j) = a YRFj -XRFj\+j3\yrTj -XRT\ where YRFj and 

YRTj respectively represent the relative pitch and relative beat duration of the note 
associated with said entry; XRFj and XRTj respectively represent the relative pitch 
and relative beat duration of the note associated with said melody ; and a and p are 
weights. 

4L (Withdrawn) A method according to claim 38, wherein: 

a confidence level is assigned to each note and each breakpoint; and 
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said search considers deletion and insertion errors for any given note of said 
melody and, in this event, penalizes the cost of matching based on the confidence 
level of the given note and the confidence level of a breakpoint associated with the 
given note. 



42. (Withdrawn) A method according to claim 41, wherein: 

X is a sequence of notes, Xj, of said melody, each X b having components XRF i5 
XRTj XICONi, and XDCONi which respectively represent the relative pitch, relative 
beat duration, confidence level of the breakpoint and confidence level of the note 
associated with said melody; 

Y is a sequence of notes, Yj, of said entry, each Yj having components YRFj 
and YRTj which respectively represent the relative pitch and relative beat duration of 
the note associated with said entry; 

X and Y form a matrix, and at a matching point (X,, Yj) said search seeks to 

identify a preceding set of notes ^X t _ Y _ k , Yj^ ) ( X^ { 5 Yj_ x _ k )} 0 < k < max k , which 
minimize a match cost defined as follows: 
if k = 0, a \yRFj - XRF U _ X ^^YRT^ - XRT t _ } j, 

else if k>0, 



a 



a 



YRF^XRF^ 



+/3 



YRTj^ - XRT h 



E (penalty for the (m+l) th insertion)*XICON H 



ATI = 0 



YRFj-v- k -XRF^ \^0\YRTj^ -XRT^ j+ (penalty for k deletions)*XDCON i _ 1 
where a and (3 are weights. 



43. (Withdrawn) Apparatus for retrieving at least one entry from a music 
database, wherein each said entry is associated with a sequence of pitches and beat 
durations, said apparatus comprising: 

means for receiving a digitized representation of an input melody; 
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a melody-to-note conversion subsystem for identifying breakpoints in said 
melody in order to define notes therein, said subsystem determining a pitch and beat 
duration for each note of said melody and associating each note or each breakpoint 
with a confidence level; 

a note-matching engine for determining a score for each said entry based on a 
search which minimizes the cost of matching the pitches and beat durations of said 
melody and said entry, wherein said search considers at least one deletion or insertion 
error in a selected note of said melody and, in this event, penalizes the cost of 
matching based on the confidence level of the selected note or a breakpoint associated 
therewith; and 

an output subsystem for presenting said at least one entry to a user based on its 

score. 

44, (Withdrawn) A method of retrieving at least one entry from a music 
database, wherein each said entry is associated with a sequence of pitches and beat 
durations, said method comprising: 

receiving a digitized representation of an input melody; 

identifying breakpoints in said melody in order to define notes therein, each 
said notes being delineated by adjacent breakpoints; 

associating a confidence level with each note pertaining to likelihood that said 
note contains a note insertion error; 

determining a pitch and beat duration for each note of said melody; 

determining a score for each said entry based on a search which minimizes the 
cost of matching the pitches and beat durations of said melody and said entry, wherein 
said search considers at least one insertion error in a selected note of said melody and, 
in this event, penalizes the cost of matching based on the confidence level associated 
with the selected note; and 

presenting said at least one entry to a user based on its score. 



Page 11 of 16 



Application No, 09/804,069 



Reply to Office Action 



45. (Withdrawn) A method of retrieving at least one entry from a music 
database, wherein each said entry is associated with a sequence of pitches and beat 
durations, said method comprising: 

receiving a digitized representation of an input melody; 
identifying breakpoints in said melody in order to define notes therein, each 
said notes being delineated by adjacent breakpoints; 

associating a confidence level with each note pertaining to likelihood that said 
note contains a note deletion error ; determining a pitch and beat duration for each 
note of said melody; 

determining a score for each said entry based on a search which minimizes the 
cost of matching the pitches and beat durations of said melody and said entry, wherein 
said search considers at least one deletion error in a selected note of said melody and, 
in this event, penalizes the cost of matching based on the confidence level associated 
with the selected note; and 

presenting said at least one entry to a user based on its score. 

46. (Withdrawn) A method for determining confidence levels for breakpoints 
or notes in a waveform representing a melody, the method comprising: 

segmenting the waveform into a series of frames, wherein adjacent breakpoints 
encompass one or more sequential frames; 

executing at least two of the following three steps, 

(a) computing a spectral energy distribution (SED) indicator for each frame, 

(b) estimating the pitch of each frame, and 

(c) determining the energy level of each frame, 

deriving the confidence levels based on at least two of the following three 
characteristics, (i) the SED indicator, (ii) changes in pitch, and (iii) the energy level 

47. (Withdrawn) A method according to claim 46, wherein the confidence 
level for a given breakpoint is computed as a weighted combination of at least two of 
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three numbers, the first number based on the value of the SED indicator in the vicinity 
of the given breakpoint, the second number being based on a change in pitch in the 
frames before and after the given breakpoint, and the third number being based on the 
energy level of the frames in the immediate vicinity of the breakpoint. 

48. (Withdrawn) A method according to claim 46, wherein the confidence 
level for a given note is computed as a weighted combination of at least two of three 
numbers, the first number based on the value of the SED indicator in the given note, 
the second number being based on the variation in pitch in the given note, and the 
third number being based on the energy level of the frames in the given note. 

49. (Withdrawn) A method for determining confidence levels for breakpoints 
or notes in a waveform representing a melody, the method comprising: 

segmenting the waveform into a series of frames, wherein adjacent breakpoints 
encompass one or more sequential frames; 

computing a spectral energy distribution (SED) indicator for each frame; 
estimating the pitch of each frame; and 

deriving the confidence levels based on the SED indicator and changes in pitch. 

50. (Withdrawn) A method according to claim 49, wherein the confidence 
level for a given breakpoint is computed as a weighted combination of a first number 
based on the value of the SED indicator in the vicinity of the given breakpoint and a 
second number based on a change in pitch in the frames before and after the given 
breakpoint. 

5 1 . (Withdrawn) A method according to claim 49, wherein the confidence 
level for a given note is computed as a weighted combination of a first number based 
on the value of the SED indicator within the given note and a second number based on 
the variation in pitch within the given note. 
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52. (Withdrawn) A method according to claim 49, wherein the value of the 
SED indicator for a given frame is relatively large if an energy distribution associated 
with the frame is concentrated in one or more specified frequency bands. 

53. (Withdrawn) A method according to claim 52, including filtering the 
melody with a high pass filter prior to segmenting the melody into frames. 

54. (Withdrawn) A method according to claim 53, wherein the energy 
distribution is determined from a normalized energy spectrum of the frame. 

55. (Withdrawn) A method according to claim 54, wherein the specified 
frequency band is in the upper portion of a 0-4kHz frequency range. 

56. (Withdrawn) A method for determining confidence levels for breakpoints 
or notes in a waveform representing a melody, the method comprising: 

segmenting the waveform into a series of frames, wherein adjacent breakpoints 
encompass one or more sequential frames; 

computing a spectral energy distribution (SED) indicator for each frame; 
determining the energy level of each frame; and 

deriving the confidence levels based on the SED indicator and the energy level. 

57. (Withdrawn) A method according to claim 56, wherein the confidence 
level for a given break point is computed as a weighted combination of a first number 
based on the value of the SED indicator in the vicinity of the given breakpoint and a 
second number based on the energy level of the frame in the immediate vicinity of the 
breakpoint. 
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58. (Withdrawn) A method according to claim 56, wherein the confidence 
level for a given note is computed as a weighted combination of a first number based 
on the value of the SED indicator in given note and a second number based on the 
energy level of the frames in the given note. 

59. (Withdrawn) A method according to claim 56, wherein the value of the 
SED indicator for a given frame is relatively large if an energy distribution associated 
with the frame is concentrated in one or more specified frequency bands. 

60. (Withdrawn) A method according to claim 59, including filtering the 
melody with a high pass filter prior to segmenting the melody into frames. 

61. (Withdrawn) A method according to claim 60, wherein the energy 
distribution is determined from a normalized energy spectrum of the frame, 

62. (Withdrawn) A method according to claim 61, wherein the specified 
frequency band is the upper portion of a 0-4kHz frequency range. 
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